“Chunking” spoken language: Introducing weak cesuras
نویسندگان
چکیده
Abstract In this introductory paper to the special issue on “Weak cesuras in talk-in-interaction”, we aim guide reader into current work “chunking” of naturally occurring talk. It is conducted methodological frameworks Conversation Analysis and Interactional Linguistics – two approaches that consider interactional aspect humans talking with each other be a crucial starting point for its analysis. doing so, will (1) lay out background (what problematic about talk-in-interaction, characteristics approach chosen by contributors, cesura model), (2) highlight what can gained from such revised understanding talk-in-interaction referring previous model as well findings contributions issue, (3) indicate further directions could take papers issue. We hope induce fruitful exchange phenomena discussed, across divides.
منابع مشابه
Chunking Clinical Text Containing Non-Canonical Language
Free text notes typed by primary care physicians during patient consultations typically contain highly non-canonical language. Shallow syntactic analysis of free text notes can help to reveal valuable information for the study of disease and treatment. We present an exploratory study into chunking such text using offthe-shelf language processing tools and pre-trained statistical models. We eval...
متن کاملSouth African Language Resources: Phrase Chunking
Phrase chunking remains an important natural language processing (NLP) technique for intermediate syntactic processing. This paper describes the development of protocols, annotated phrase chunking data sets and automatic phrase chunkers for ten South African languages. Various problems with adapting the existing annotation protocols of English are discussed as well as an overview of the annotat...
متن کاملWeak Semi-Markov CRFs for NP Chunking in Informal Text
This paper introduces a new annotated corpus based on an existing informal text corpus: the NUS SMS Corpus (Chen and Kan, 2013). The new corpus includes 76,490 noun phrases from 26,500 SMS messages, annotated by university students. We then explored several graphical models, including a novel variant of the semi-Markov conditional random fields (semi-CRF) for the task of noun phrase chunking. W...
متن کاملthe role of thematic structure in comprehending spoken language
in fact this study is concerned with the relationship between the variation in thematice structure and the comprehension of spoken language. so the study focused on the following questions: 1. is there any relationship between thematic structure and the comprehension of spoken language? 2. which of the themes would have greated thematic force and be easier for the subjects to comprehend? accord...
15 صفحه اولSpoken Language Systems II
The papers in this session were concerned with higher-level processing in speech recognition systems and, in some cases, the interface between the speech-recognition and natural-language components of a spoken language system. The session consisted of talks from four DARPA sites, Dragon Systems, SRI International, BBN Systems and Technologies, and MIT Lincoln Laboratory. These talks were follow...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Open Linguistics
سال: 2021
ISSN: ['2300-9969']
DOI: https://doi.org/10.1515/opli-2020-0173